Apparatus and method for managing telepresence sessions

ABSTRACT

A system that incorporates teachings of the present disclosure may include, receiving first and second media content portraying images of first and second users, wherein the first and second users participate in a conference call. Joint media content is generated of the conference call including the images of the first and second users, that when presented at a display of a third user includes the images of the first and second users. Speech of the first user directed to the second user is detected and, in response, the image of the first user of the joint media content is adjusted to obtain a rotated image of the first user. The joint media content includes the rotated image of the first user and the image of the second user, that when presented at the display of the third user portrays the first user turned toward the second user.

CROSS-REFERENCE TO RELATED APPLICATIONS

This application is a Continuation of and claims priority to U.S. patentapplication Ser. No. 14/856,920, filed Sep. 17, 2015, which is aContinuation and claims priority to U.S. patent application Ser. No.14/568,844, filed Dec. 12, 2014, which is a Continuation of and claimspriority to Ser. No. 13/173,281, filed Jun. 30, 2011, now issued as U.S.Pat. No. 8,947,497. The contents of each of the foregoing is/are herebyincorporated by reference into this application as if set forth hereinin full.

FIELD OF THE DISCLOSURE

The present disclosure relates generally to communication and morespecifically to an apparatus and method for managing telepresencesessions.

BACKGROUND

Media consumption has become a multibillion dollar industry thatcontinues to grow rapidly. High resolution displays such as highdefinition televisions and high resolution computer monitors can presenttwo-dimensional movies and games with three-dimensional perspective.Collectively, improvements in viewing, audio, and communicationtechnologies are causing demand for consumption of all types of mediacontent. Individuals often desire to share their experiences, includingwith respect to media consumption, products and services. The sharing ofthese experiences is often limited by the capabilities of communicationdevices being utilized for messaging and the like. The sharing of theseexperiences is often limited by factors that are independent of thecommunication devices.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 depicts an illustrative embodiment of a communication system thatprovides media services with telepresence;

FIG. 2 depicts an illustrative embodiment of a presentation device andmedia processor for presenting media content that can be used in thesystem of FIG. 1;

FIG. 3 depicts an illustrative embodiment of a viewing apparatus thatcan be used with FIG. 2;

FIG. 4 depicts an illustrative embodiment of a presentation device witha polarized display that can be used in the system of FIG. 1;

FIGS. 5-7 depict illustrative embodiments of communication systems thatprovide media services with telepresence;

FIG. 8 depicts an illustrative embodiment of a method operating inportions of the devices and systems of FIGS. 1-7;

FIG. 9 depicts an illustrative embodiment of a communication system thatprovides media services with telepresence;

FIG. 10 depicts an illustrative embodiment of a method operating inportions of the devices, systems and/or methods of FIGS. 1-9; and

FIG. 11 is a diagrammatic representation of a machine in the form of acomputer system within which a set of instructions, when executed, maycause the machine to perform any one or more of the methodologiesdiscussed herein.

DETAILED DESCRIPTION

The present disclosure describes, among other things, illustrativeembodiments of methods and devices for synchronizing telepresencesessions among a plurality of users. The telepresence sessions presentmedia content, as well as video content of the users, which simulate aco-location of the users at each of the other user locations. In one ormore embodiment, latency areas of a network can be detected and routescan be configured for the telepresence sessions based on the latencyarea. In one or more embodiments, a delay can be injected into deliveryand/or presentation of one or both of the media content and the videocontent for select user locations to synchronize the telepresenceconfigurations. In one or more embodiments, loopback testing can beperformed to determine the latency area and/or a time period fordelaying the delivery and/or presentation of one or both of the mediacontent and the video content for select user locations. In one or moreembodiments, dedicated routes can be provided for telepresence sessions,such as based on a service upgrade and/or a history of utilization oftelepresence sessions. This application is related to U.S. patentapplication Ser. No. 13/173,310, entitled “Apparatus and Method forProviding Media Content” by Hines et al., now issued as U.S. Pat. No.9,030,522, co-pending U.S. patent application Ser. No. 13/168,539, filedon Jun. 24, 2011, entitled “Apparatus and Method for Presenting MediaContent with Telepresence” by Hines et al., and co-pending U.S. patentapplication Ser. No. 13/168,549, filed on Jun. 24, 2011, entitled“Apparatus and Method for Presenting Three Dimensional Objects withTelepresence” by Hines et al. The contents of each of the foregoingis/are hereby incorporated by reference into this application as if setforth herein in full.

In one or more embodiments, the media content and/or the video content,or a portion thereof, can be presented as three dimensional (3D) contentto enhance the telepresence. In one or more embodiments, the 3D contentcan be generated by a remote server and/or can be generated by eachmedia processor, such as through use of a depth map. In one or moreembodiments, 3D cameras and/or a plurality of two dimensional cameras ateach user location can be utilized for generating 3D content. In one ormore embodiments, images of a user can be rotated or otherwiserepositioned during presentation in response to detecting speech of theuser to further enhance the telepresence by simulating the user facinganother user to speak. Other embodiments are also included herein.

One embodiment of the present disclosure can include a server thatincludes a memory and a controller coupled to the memory. The controllercan be adapted to receive a request for a telepresence session betweenfirst and second media processors, where the first media processor islocated at a first location of a first user, and where the second mediaprocessor is located at a second location of a second user. Thetelepresence session can include providing media content and videocontent over a network for presentation in a telepresence configurationat a first display device of the first location and at a second displaydevice of the second location. The telepresence configuration simulatesthe first user being present at the second location and simulates thesecond user being present at the first location. The controller isadapted to determine a latency area of the network based on latencytesting and configure routes over the network for the telepresencesession based on the latency area. The controller is adapted todetermine latency associated with the presentation of the telepresenceconfiguration at the first location and delay presentation of at leastone of the media content and the video content at the second locationbased on the determined latency associated with the presentation of thetelepresence configuration at the first location. At least one of themedia content and the video content are presented as three dimensionalcontent at the first display device.

One embodiment of the present disclosure can include a method thatincludes obtaining first images that are captured by a first camerasystem at a first location associated with a first user and transmittingfirst video content representative of the first images over a networkfor presentation by a second media processor at a second locationassociated with a second user. The method includes receiving over thenetwork at a first media processor of the first location, media contentand second video content representative of second images that areassociated with the second user. The method includes presenting at afirst display device of the first location, the media content and thesecond video content in a first telepresence configuration thatsimulates a presence of the second user at the first location. The mediacontent and the first video content can be adapted for presentation bythe second media processor in a second telepresence configuration thatsimulates a presence of the first user at the second location. Thepresentation of at least one of the media content and the second videocontent at the first display device can be delayed based on latencyparameters associated with the presentation of the second telepresenceconfiguration by the second media processor thereby synchronizing thefirst and second telepresence configurations.

One embodiment of the present disclosure can include a non-transitorycomputer-readable storage medium that includes computer instructions.The instructions can enable obtaining media content at a server. Theinstructions can enable receiving over a network at the server, firstvideo content of a first user at a first location. The instructions canenable receiving over the network at the server, second video content ofa second user at a second location. The instructions can enablereceiving over the network at the server, third video content of a thirduser at a third location. The instructions can enable transmitting overthe network from the server, the media content and the second and thirdvideo content to a first media processor for presentation in a firsttelepresence configuration at a first display device that simulates thesecond and third users being present at the first location. Theinstructions can enable transmitting over the network from the server,the media content and the first and third video content to a secondmedia processor for presentation in a second telepresence configurationat a second display device that simulates the first and third usersbeing present at the second location. The instructions can enabletransmitting over the network from the server, the media content and thefirst and second video content to a third media processor forpresentation in a third telepresence configuration at a third displaydevice that simulates the first and second users being present at thethird location. The instructions can enable synchronizing thepresentations of the first, second and third telepresence configurationsby determining a largest latency among a select one of the presentationsof the first, second and third telepresence configurations and delayingthe presentations of the remaining others of the first, second and thirdtelepresence configurations based on the determined largest latency.

FIG. 1 depicts an illustrative embodiment of a first communicationsystem 100 for delivering media content, which can include 3D mediacontent. System 100 provides for synchronizing telepresence sessionsamong a plurality of users at different locations to simulate aco-location of the users at each of the other user locations. System 100provides for the detection of latency areas of a network where routescan be configured for the telepresence sessions based on the latencyarea. System 100 also provides for injection of a delay into deliveryand/or presentation of one or both of the media content and the videocontent for select user locations to synchronize the telepresenceconfigurations.

The communication system 100 can represent an Internet ProtocolTelevision (IPTV) broadcast media system although other media broadcastsystems can be utilized by the present disclosures. The IPTV mediasystem can include a super head-end office (SHO) 110 with at least onesuper headend office server (SHS) 111 which receives media content fromsatellite and/or terrestrial communication systems. In the presentcontext, media content can represent audio content, moving image contentsuch as videos, still image content, or combinations thereof. The SHSserver 111 can forward packets associated with the media content tovideo head-end servers (VHS) 114 via a network of video head-end offices(VHO) 112 according to a common multicast communication protocol.

The VHS 114 can distribute multimedia broadcast programs via an accessnetwork 118 to commercial and/or residential buildings 102 housing agateway 104 (such as a residential or commercial gateway). The accessnetwork 118 can represent a group of digital subscriber line accessmultiplexers (DSLAMs) located in a central office or a service areainterface that provide broadband services over optical links or coppertwisted pairs 119 to buildings 102. The gateway 104 can use commoncommunication technology to distribute broadcast signals to mediaprocessors 106 such as computers, Set-Top Boxes (STBs) or gamingconsoles which in turn present broadcast channels to display devices 108such as television sets or holographic display devices, managed in someinstances by a media controller 107 (such as an infrared or RF remotecontrol, gaming controller, etc.).

The gateway 104, the media processors 106, and/or the display devices108 can utilize tethered interface technologies (such as coaxial, phoneline, or powerline wiring) or can operate over a common wireless accessprotocol such as Wireless Fidelity (WiFi). With these interfaces,unicast communications can be invoked between the media processors 106and subsystems of the IPTV media system for services such asvideo-on-demand (VoD), browsing an electronic programming guide (EPG),or other infrastructure services.

Some of the network elements of the IPTV media system can be coupled toone or more computing devices 130, where a portion of these computingdevices can operate as a web server for providing portal services overan Internet Service Provider (ISP) network 132 to media processors 106,wireline display devices 108 or wireless communication devices 116(e.g., cellular phone, laptop computer, etc.) by way of a wirelessaccess base station 117 operating according to common wireless accessprotocols such as WiFi, or cellular communication technologies (such asGSM, CDMA, UMTS, WiMAX, Software Defined Radio or SDR, and so on).

A satellite broadcast television system can be used in conjunction with,or in place of, the IPTV media system. In this embodiment, signalstransmitted by a satellite 115 carrying media content can be interceptedby a common satellite dish receiver 131 coupled to the building 102.Modulated signals intercepted by the satellite dish receiver 131 can betransferred to the media processors 106 for decoding and distributingbroadcast channels to the display devices 108. The media processors 106can be equipped with a broadband port to the IP network 132 to enableservices such as VoD and EPG described above.

In yet another embodiment, an analog or digital broadcast distributionsystem such as cable TV system 133 can be used in place of or inconjunction with the IPTV media system described above. In thisembodiment the cable TV system 133 can provide Internet, telephony, andinteractive media services.

The present disclosure can apply to any present or next generationover-the-air and/or landline media content services system. In oneembodiment, an IP Multimedia Subsystem (IMS) network architecture can beutilized to facilitate the combined services of circuit-switched andpacket-switched systems in delivering the media content to one or moreviewers.

System 100 can provide 3D content to the building 102 for presentationand/or can provide 2D content that can be rendered into 3D content byone or more client devices, such as the media processor 106 or the TV108. The 3D image content can be based upon various 3D imagingtechniques, including polarization, anaglyphics, active shuttering (suchas alternate frame sequencing), autostereoscopy, and so forth. Thepresent disclosure can include presentation of all or a portion of adisplay in 3D, including utilizing devices that do not require awearable viewing apparatus (e.g., does not require active shutteringglasses).

In one embodiment, system 100 can include one or more image capturingdevices 175 (e.g., a camera) that can capture 2D and/or 3D images of auser and/or other objects at the building 102. Other components can beutilized in combination with or in place of the camera 175, such as ascanner (e.g., a laser system that detects object circumference),distance detector, and so forth. In one embodiment, camera 175 can be agroup of cameras, such as two or more cameras for providing differentviewing angles and/or for providing a holographic image. In oneembodiment, the camera 175 can capture images in 2D which are processedinto 3D content, such as by media processor 106 and/or computing device130. In one embodiment, depth maps can be utilized to generate 3Dcontent from 2D images. In another embodiment, the camera 175 can be astereoscopic camera that directly captures 3D images, such as throughuse of multiple lenses. A collector 176 or other component canfacilitate the processing and/or transmission of the captured images.The collector 176 can be a stand-alone device, such as in communicationwith the media processor 106 and/or the gateway 104 (e.g., wirelesslyand/or hardwired communication) or can be integrated with anotherdevice, such as the media processor 106.

Computing device 130 can also include computer readable storage medium180 having computer instructions for establishing a telepresencecommunication session between client devices. The computing device 130can provide media content to a number of different users at differentlocations, such as a user at building 102, via the telepresencecommunication session. Computing device 130 can provide the mediacontent in a telepresence configuration that simulates each of the otherusers (not shown) being present at building 102. For instance, thetelepresence configuration can display the media content and furtherdisplay each of the other users to simulate them watching the mediacontent. In one embodiment, the particular telepresence configurationcan be adjusted by one or more of the users based on user preferences,such as retrieved from a user profile or determined from monitoredviewing behavior.

In one or more embodiments, the storage medium 180 can include computerinstructions for determining a latency area of the access network 118and can configure routes for the telepresence session based on thelatency area, such as avoiding use of one or more network elements ofthe latency area. In one or more embodiments, the storage medium 180 caninclude computer instructions for determining a delay to inject orotherwise provide to the delivery and/or presentation of thetelepresence configuration to select locations in order to synchronizethe telepresence sessions among locations. For example, the computingdevice 130 can delay delivery of the video content and/or the mediacontent (e.g., a unicast or multicast of the media content) to a numberof locations so that those locations can be synchronized with anotherlocation that is experiencing latency.

In one or more embodiments, the media content and/or the images of theusers, or a portion thereof, can be presented as 3D content to enhancethe telepresence. For example, the 3D content can be generated bycomputing device 130 and/or can be generated by media processor 106,such as through use of a depth map in combination with the correspondingimages. System 100 can include other components to enhance thetelepresence experience. For instance, lighting and audio components canbe utilized to facilitate capturing the images and audio from a user.The lighting and/or audio components can be controlled by the mediaprocessor 106 and/or by the computing device 130. User preferencesand/or monitored behavior can be utilized in controlling the lightingand/or audio components.

In one embodiment, the users can be part of a social network and thecomputing device 130 can be in communication with a social networkapplication, such as for selecting the media content to be provided inthe telepresence configuration. In one embodiment, one of the mediaprocessors 106 can maintain control over presentation of the mediacontent in the telepresence configuration, such as pause, fast-forward,rewind, size, resolution, and so forth. In one embodiment, thetelepresence configuration, including providing the media content andthe video content of each of the users, can be performed without usingthe computing device 130 to generate the video content from capturedimages or to combine the media and video content. In one example, thetelepresence configuration can be generated by the media processors anddistributed through a peer-to-peer technique, where the media processorsshare the video content amongst themselves and obtain the media contentfrom one of the media processors or from another source, such as mediacontent being broadcast. In one embodiment, each of the media processors106 of the different users can be in a master-slave arrangement tocontrol presentation of the media content and facilitate generating thetelepresence configuration.

System 100 enables video and/or audio content of the users to beprovided to the other users in real-time to establish a communicationsession while simulating the co-location of the users and providingtelepresence with the media content.

FIG. 2 depicts an illustrative embodiment of a presentation device 202and the media processor 106 for presenting a telepresence configuration210 that can include video content 225 which is captured images of oneor more other users that are at different locations from where thepresentation device 202 is located. The telepresence configuration 210can also include the media content 250. The telepresence configuration210 can simulate the other users being present at the location of thepresentation device 202 through use of the video content 225. Thesimulation can be performed in a number of different ways, includingpresenting the other users in the images as if they were viewing themedia content. The simulation can be facilitated by the positioning ofthe camera 175 and/or by post-capture processing, such as adjusting thevideo content 225 so that the other users appear as being rotatedtowards the media content 250. Other simulation effects can be utilized.For example, the images in the video content 225 can be re-sized,including based on the particular size of the presentation device 202,to further simulate the other users being present at the location of thepresentation device 202. The media content 250 and/or video content 225of one or more users can be provided for presentation in thetelepresence configuration 210 in 3D.

One or both of the presentation device 202 and the media processor 106can include the camera 175 that captures images of the user that areprovided to the other users in their telepresence configuration 210. Thecamera 175 can capture 2D images and/or can capture 3D images. Thecamera 175 can be a group of cameras to capture multiple views,including views to construct a holographic image, such as of the userand/or of objects associated with the user. In one embodiment, thepresentation device 202 can be a holographic display device thatpresents all or a portion of the telepresence configuration 210 asholographic content. The holographic content can allow a viewer'sperspective on a depicted object to change as the viewer moves aroundthe hologram content, just as it would if the object were real.

In the present illustration, the presentation device 202 is depicted asa television set. It will be appreciated that the presentation device202 can represent a portable communication device such as a cellularphone, a PDA, a computer, or other computing device with the ability todisplay media content. The media processor 106 can be an STB, or someother computing device such as a cellular phone, computer, gamingconsole, or other device that can process and direct the presentationdevice 202 to present images associated with media content. It isfurther noted that the media processor 106 and the presentation device202 can be an integral unit. For example, a computer or cellular phonehaving computing and display resources collectively can represent thecombination of a presentation device 202 and media processor 106.

The media processor 106 can be adapted to communicate with accessoriessuch as the viewing apparatus 300 of FIG. 3 by way of a wired orwireless interface, such as through RF and/or light waves 206. Thecommunication can be one-way and/or two-way communication, such asproviding the viewing apparatus 300 with a transceiver 302. A wiredinterface can represent a tethered connection from the viewing apparatus300 to an interface of the media processor (e.g., USB or proprietaryinterface). A wireless interface can represent a radio frequency (RF)interface such as Bluetooth, WiFi, Zigbee or other wireless standard.The wireless interface can also represent an infrared communicationinterface. Any standard or proprietary wireless interface between themedia processor 106 and the viewing apparatus 300 is can be utilized bythe presented disclosure.

The viewing apparatus 300 can represent an apparatus for viewingtwo-dimensional and/or 3D stereoscopic images which can be still ormoving images. The viewing apparatus 300 can be an active shutterviewing apparatus. In this embodiment, each lens has a liquid crystallayer which can be darkened or made to be transparent by the applicationof one or more bias voltages. Each lens 304, 306 can be independentlycontrolled. Accordingly, the darkening of the lenses can alternate, orcan be controlled to operate simultaneously.

Each viewing apparatus 300 can include various components associatedwith a communication device including a wireline and/or wirelesstransceiver 302 (herein transceiver 302), a user interface (UI), a powersupply, a location detector, and a controller 307 for managingoperations thereof. The transceiver 302 can support short-range orlong-range wireless access technologies such as infrared, Bluetooth,WiFi, Digital Enhanced Cordless Telecommunications (DECT), or cellularcommunication technologies, just to mention a few. Cellular technologiescan include, for example, CDMA-1X, UMTS/HSDPA, GSM/GPRS, TDMA/EDGE,EV/DO, WiMAX, SDR, and next generation cellular wireless communicationtechnologies as they arise. The transceiver 302 can also be adapted tosupport circuit-switched wireline access technologies (such as PSTN),packet-switched wireline access technologies (such as TCPIP, VoIP,etc.), and combinations thereof.

The UI can include a depressible or touch-sensitive keypad with anavigation mechanism such as a roller ball, joystick, mouse, ornavigation disk for manipulating operations of the communication device300. The keypad can be an integral part of a housing assembly of theapparatus 300 or an independent device operably coupled thereto by atethered wireline interface (such as a USB cable) or a wirelessinterface supporting for example Bluetooth. The keypad can represent anumeric dialing keypad commonly used by phones, and/or a Qwerty keypadwith alphanumeric keys. The UI can further include a display such asmonochrome or color LCD (Liquid Crystal Display), OLED (Organic LightEmitting Diode) or other suitable display technology for conveyingimages to an end user of the apparatus 300. In an embodiment where thedisplay is touch-sensitive, a portion or all of the keypad 308 can bepresented by way of the display.

The UI can also include an audio system 312 that utilizes common audiotechnology for conveying low volume audio (such as audio heard only inthe proximity of a human ear) and high volume audio for hands freeoperation. The audio system 312 can further include a microphone forreceiving audible signals of an end user. The audio system 312 can alsobe used for voice recognition applications. The UI can further includean image sensor such as a charged coupled device (CCD) camera forcapturing still or moving images.

The power supply can utilize common power management technologies suchas replaceable and rechargeable batteries, supply regulationtechnologies, and charging system technologies for supplying energy tothe components of the apparatus 300 to facilitate long-range orshort-range portable applications. The location detector can utilizecommon location technology such as a global positioning system (GPS)receiver for identifying a location of the communication device 300based on signals generated by a constellation of GPS satellites, therebyfacilitating common location services such as navigation.

The transceiver 302 can also determine a proximity to a cellular, WiFior Bluetooth access point by common power sensing techniques such asutilizing a received signal strength indicator (RSSI) and/or a signaltime of arrival (TOA) or time of flight (TOF). The controller 306 canutilize computing technologies such as a microprocessor, a digitalsignal processor (DSP), and/or a video processor with associated storagememory such a Flash, ROM, RAM, SRAM, DRAM or other storage technologies.

In one embodiment, the viewing apparatus 300 can utilize a receiverportion of the transceiver 302 in the form of an infrared.Alternatively, the viewing apparatus 300 can function as a two-waycommunication device, in which case a full infrared transceiver could beutilize to exchange signals between the media processor 106 and theviewing apparatus 300.

The viewing apparatus 300 can utilize a controller 307 to controloperations thereof, and a portable power supply (not shown). The viewingapparatus 300 can have portions of a UI. For example, the viewingapparatus 300 can have a multi-purpose button 312 which can function asa power on/off button and as a channel selection button. A power on/offfeature can be implemented by a long-duration depression of button 312which can toggle from an on state to an off state and vice-versa. Fastdepressions of button 312 can be used for channel navigation.Alternatively, two buttons can be added to the viewing apparatus 300 forup/down channel selection, which operate independent of the on/off powerbutton 312. In another embodiment, a thumbwheel can be used forscrolling between channels.

The viewing apparatus 300 can also include an audio system 313 with oneor more speakers in the extensions of the housing assembly such as shownby references 314, 316 to produce localized audio 318, 320 near a user'sears. Different portions of the housing assembly can be used to producemono, stereo, or surround sound effects. Ear cups (not shown) such asthose used in headphones can be used by the viewing apparatus 300 (as anaccessory or integral component) for a more direct and low-noise audiopresentation technique. The volume of sound presented by the speakers314, 316 can be controlled by a thumbwheel 310 (or up/down buttons—notshown).

It would be evident from the above descriptions that many embodiments ofthe viewing apparatus 300 are possible, all of which are included in thepresent disclosure. In one embodiment, the viewing apparatus 300 can beutilized as part of the image capture process. For instance, thetransceiver 302 can function to transmit a locator and/or calibrationrequest that is wirelessly emitted for receipt by the camera(s) 175 oranother processing device, such as the media processor 106. The emittedsignal can be position information that is utilized to facilitatecapturing images of a target, including adjusting the positioning andfocus of the camera(s) 175 to capture the user and/or another object.

In one embodiment, the presentation device 202 can present holographiccontent that enables different perspectives of a user and/or object tobe viewed depending on the position of the viewer. The holographiccontent can be all or a portion of the telepresence configuration 210,such as only the media content 250 or only one or more of the videocontent 225. As an example, the presentation device 202 can utilizeactive shuttering where different perspectives of an image are presentedduring different time slots which can be synchronized with the viewingapparatus 300. The particular perspective of an image can be viewed viathe active shuttering of the viewing apparatus 300 based on the positionof the viewer, such as detected from the viewing apparatus. An exampleof this is described in U.S. application Ser. No. 12/839,943 filed onJul. 20, 2010, the disclosure of which is hereby incorporated byreference in its entirety. Other techniques and components can beutilized for presenting holographic content at the presentation device202, including with or without a viewing apparatus 300.

In one embodiment, the images of the user in video content 225 can bemodified, including change of clothing, environment and/or appearance.For example, the images of the other users can be presented but withoutthe viewing apparatus 300 being worn. For instance, other images of theother users, such as in user profiles, can be utilized to modify theimages to fill in pixels where the viewing apparatus 300 was removed. Inanother example, the modification of the images of the video content 225can be based on the media content, such as the images of the other usersbeing presented but wearing a cowboy hat where the media content is acowboy movie. The modifications to the video content 225 can be based ona number of different factors, such as user preferences, and can becontrolled by various entities, such as allowing a user to retaincontrol over any modifications to the presentation of their own imagesand/or allowing a user to control any modification to the presentationof other users.

FIG. 4 depicts an illustrative embodiment of a presentation device 402with a polarized display. A display can be polarized with polarizationfilter technology so that alternative horizontal pixel rows can be madeto have differing polarizations. For instance, odd horizontal pixels 402can be polarized for viewing with one polarization filter, while evenhorizontal pixels 404 can be polarized for viewing with an alternativepolarization filter. The viewing apparatus 300 previously described canbe adapted to have one lens polarized for odd pixel rows, while theother lens is polarized for viewing even pixel rows. With polarizedlenses, the viewing apparatus 300 can present a user a 3D stereoscopicimage. The telepresence configuration 210 of FIG. 2 can be presentedutilizing the presentation device 402.

System 400 illustrates use of multiple cameras 175 for capturing imagesof user 420 from different perspectives or views. The differentperspective images can then be utilized for generating a 3Drepresentation of the user 420. The particular number and positioning ofthe cameras 175 can vary. In one embodiment, one of the cameras 175 canbe a depth or distance camera that is utilized for generating a depthmap associated with the user 420 so that the depth map and imagescaptured by the other cameras can be used in constructing the 3Drepresentation of the user 420.

FIG. 5 depicts an illustrative embodiment of a communication system 500that can provide the telepresence configuration 210 to a plurality oflocations 102, 502 and 503. While three locations are illustrated insystem 500, the present disclosure can utilize two or more locations.The telepresence configuration 210 for each of the locations 102, 502and 503 includes the media content 250 and includes video content 225for the other users. For example, a user 520 at location 102 is providedwith video content 225 that includes other users 525 at locations 502and 503. The computing device 130 can be utilized to provide thetelepresence configuration 210 to each of the locations 102, 502, 503,such as through receiving captured images of each of the users 520 and525 and distributing the video content 225 and the media content 250 toeach of the locations. As an example, each of the media processors 106can then present the video content 225 and the media content 250, suchas in the side-by-side window arrangement shown in FIG. 2. In oneembodiment, the captured images and the media content 250 can becombined by the computing device 130 into single content that isprovided to the locations 102, 502 and 503, such as through a multicast,without the need for further arranging the media and video content. Inone embodiment, separate or a combined stream of the media content 250and the video content(s) 225 can be provided to each media processor 106for combining into the telepresence configuration 210.

In one embodiment, the media processor 106 can instruct the users 520and 525 to sit or otherwise position themselves where they will bewatching the telepresence configuration 210. A position of the user canthen be determined for adjusting the camera 175. A distance to theviewer can be determined, such as through use of time-of-flight, stereotriangulation, sheet of light triangulation, structured light,interferometry, coded aperture, and so forth. Other components can alsobe utilized to facilitate the process, including a depth cameraintegrated with camera 175 or provided as a stand-alone component.

FIG. 6 depicts an illustrative embodiment of another communicationsystem 600 that can present the telepresence configuration 210 atdisplay devices 108 of different users at different locations via atelepresence communication session. System 600 can be overlaid oroperably coupled with the devices and systems of FIGS. 1-5 to receivemedia content 250 and/or video content 225, which is presentable as 3Dcontent. System 600 can include computing device 130 for receiving 2Dmedia content from a media source 650 and for generating (or otherwiseobtaining) a depth map associated with the media content, such as basedon object segmentation. The computing device 130 can encode the mediacontent and depth map (such as into a single video stream in H.264format encapsulated in an MPEG-2 wrapper) and transmit the media contentand depth map to one or more media processors 106, such as throughbroadcast, multicast and/or unicast utilizing network 625. In oneembodiment, the computing device 130 can generate the depth map inreal-time or near real-time upon receipt of the 2D media content, suchas from a broadcast studio. The computing device 130 can also generate adepth map for video content that is captured by the cameras 175 in 2D.

System 600 includes media processors 106 which receive the video streamof the 2D media and video content and the corresponding depth maps. Themedia processors 106 can generate 3D content using the depth maps inreal time upon receipt of the video stream. The media processors 106 canalso detect the capability of display devices (such as through HDMI1.4a) and can adjust the media content accordingly. For instance, if adisplay device 108 can only present 2D content, then the media processor106 may discard the depth map and provide the 2D content to the displaydevice. Otherwise, the media processor 106 can perform the real-timegeneration of the 3D content using the depth map and provide the contentto the 3D capable display device 108. The conversion into 3D contentfrom the depth map(s) can be based upon various imaging techniques andthe 3D presentation in the telepresence configuration 210 can be basedupon various formats including polarization, anaglyphics, activeshuttering (such as alternate frame sequencing), autostereoscopy, and soforth.

In one embodiment, position information associated with one or moreviewers can be utilized to adjust 3D media content, such as adjusting aconvergence of the media content 250 and/or video content 225 based on adistance of the viewer(s) from the display device 108. Calibration canalso be performed using a number of components and/or techniques,including a distance camera to measure distances and/or image camera 175for capturing images of the viewers which can be used for interpolatingdistances.

System 600 has the flexibility to selectively provide 2D content and 3Dcontent to different locations. System 600 further has the flexibilityto selectively provide a combination of 2D and 3D content forpresentation in the telepresence configuration 210 (FIG. 2). Forexample, a user may desire to watch the media content 250 in 3D whileviewing the video content 225 in 2D. The selection of 2D or 3Dpresentation can be based on a number of factors, including devicecapability and type of content. The selection can be made by a number ofdifferent entities, including the users via the media processors 106and/or by the service provider via computing device 130. The selectionof 2D or 3D can also be made by one or more devices of system 600without user intervention based on a number of factors, such as devicecapability, network status, viewing history, and so forth.

FIG. 7 depicts an illustrative embodiment of another communicationsystem 700 that can present a telepresence configuration 710 atpresentation devices 202 of different users at different locations 102,502, 503 via a telepresence communication session. System 700 can beoverlaid or operably coupled with the devices and systems of FIGS. 1-6to receive media content and/or video content which is presentable as 3Dor holographic content. System 700 can include components similar tothat of system 600, such as the media processor 106, the presentationdevice 202, the computing device 130 and the cameras 175. Thepresentation device can be various types of display devices includingtelevisions, holographic display devices, volumetric display devices,and so forth.

While three locations are illustrated in system 700, the presentdisclosure can utilize two or more locations. The telepresenceconfiguration 710 for each of the locations 102, 502 and 503 can includeobject content 750 and can include video content 225 for the otherusers. For example, a user 520 at location 102 can be provided withvideo content 225 that includes other users 525 at locations 502 and503. The computing device 130 can be utilized to provide thetelepresence 710 to each of the locations 102, 502, 503, such as throughreceiving captured images of each of the users 520 and 525 anddistributing the video content 225 and the object content 750 to each ofthe locations. As an example, each of the media processors 106 can thenpresent the video content 225 and the object content 750, such as in aside-by-side window arrangement that simulates the users 525 beingpresent at the location 102, such as positioning the video content 225as if the users 525 were viewing the object content 750. In oneembodiment, the captured images of the users (e.g., video content 225)and the object content 750 can be combined by the computing device 130into single content that is provided to the locations 102, 502 and 503,such as through a multicast, without the need for further arranging theobject and video content. In one embodiment, separate or a combinedstream of the object content 750 and the video content(s) 225 can beprovided to each media processor 106 for combining into the telepresenceconfiguration 710.

The object content 750 can be generated based on images captured by acamera system 725 that includes a group of cameras 175. The group ofcameras 175 can be positioned to capture different viewing angles for anobject 730. The images can then be processed into the object content 750by generating 3D images from 2D images for the object 730 and/orcapturing 3D images using 3D stereoscopic cameras. Various 3D techniquesand components can be utilized, including polarization, anaglyphics,active shuttering (such as alternate frame sequencing), autostereoscopy,and so forth.

In one embodiment, the generated object content 750 is 3D content thatis holographic content. The holographic content provides differentviewing perspectives of the object 730 based on viewer position inreference to a display device. The object content 750 can be generatedin whole or in part by various devices in system 700, such as computingdevice 130 and/or media processor 106. In one embodiment, the selectionof a device to perform the generation of the object content 750 or aportion thereof can be based on load-balancing. For instance, localdevices such as media processor 106 of location 503 can generate all ora portion of the object content 750 when a desired amount of processingresources are available for the local media processor 106. However, if adesired amount of processing resources are not available for the localmedia processor 106 at location 503 then other devices, such as one ormore of the other media processors at locations 102 and 502 and thecomputing device 130 can generate the object content 750.

In one embodiment, a plurality of formats can be generated for theobject content 750. The different formats can be based on thecapabilities of the media processors 106 and/or the presentation devices202. For instance, holographic content may be generated for the mediaprocessor 106 if it is determined that the presentation device 202 atlocation 102 is a holographic display device or otherwise has theability to present holographic images, while 3D content based on activeshuttering can be generated for the media processor 106 of location 502if it is determined that capabilities at location 502 warrant thisformat. In one embodiment, the selection and generation of the format ofthe object content 750 can be based on capability determinations beingmade by the devices of system 700, such as the computing device 130querying the local devices for display capabilities and/or accessinguser profiles or past history information to make the determination. Inone embodiment, each of the various formats can be generated withoutregard to device capabilities and a selection can then be made of thecorresponding format to be transmitted.

In one embodiment, the group of cameras 175 of camera system 725 can bearranged to surround the object 730, such as capturing or otherwisecovering a 360 degree perspective of the object. This configuration canfacilitate generating holographic content and/or generating 3D contentthat can be navigated. In one embodiment, the group of cameras 175 ofcamera system 725 can be arranged such that the plurality of differentviewing angles of the images captures only a portion of 360 degrees ofviewing perspective of the object 730. In one example, the computingdevice 130 and/or local devices (e.g., the media processor(s) 106) cangenerate additional images for a remaining portion of the 360 degrees ofthe viewing perspective of the object 730 based on the captured images.The additional images can then be utilized with the captured images togenerate holographic content and/or to generate 3D content that can benavigated. In one example, the computing device 130 and/or local devices(e.g., the media processor(s) 106) can control the position of one ormore of the cameras 175 to capture images for the remaining portion ofthe 360 degrees of the viewing perspective of the object 730. Theadditional captured images can then be utilized with the captured imagesto generate holographic content and/or to generate 3D content that canbe navigated.

The cameras 175 of camera system 725 can be arranged in variousconfigurations and there can be various numbers of cameras. For example,the cameras 175 can surround the object 730 in a circular configurationor can surround the object in a spherical configuration. As describedabove, the cameras 175 may only partially surround the object 730, andcamera movement and/or image extrapolation can be performed to accountfor any viewing angles or portions of the object that are not covered bythe particular camera configuration. As an example, image extrapolationor interpolation can be utilized that predicts or estimates unknownportions of the object 730 based on known portions of the objectdetermined from one or more of the captured images. The object 730 ofFIG. 7 is illustrated as a vase having a substantially uniform curvedsurface and curved rim. The captured images can be utilized to determinea radius of curvature of the rim and the shape of the outer surface ofthe vase of object 730 as shown in the captured images. These parameterscan then be used in image extrapolation or interpolation to fill in theunknown portions of the image that were not captured in the images.Other techniques for determining unknown portions of the object 730 canalso be used in the present disclosure.

System 700 and camera system 725 allow various objects to be placed infront of the group of cameras so that 3D content or holographic contentrepresentative of the objects can be shared among viewers in atelepresence environment. For example, camera system 725 can define atarget field or capture area 790 into which objects can be placed, suchas object 730, so that the objects can be provided in the telepresenceconfiguration 710. In one embodiment, one or more of the cameras 175that define the target field 790 can be re-positioned to capture variousperspectives of the object. The re-positioning of the cameras 175 can beperformed in a number of different ways, such as pivoting cameras,sliding cameras on a track (e.g., a circular or annular track), and soforth. In one embodiment, the re-positioning of the cameras 175 can beperformed automatically based on actuation of motors (e.g., electricservo-motors) coupled with the cameras that can adjust the position ofthe camera.

FIG. 8 depicts an illustrative embodiment of a method 800 operating inportions of the devices and systems described herein and/or illustratedin FIGS. 1-7. Method 800 can begin with step 802 in which media content250 is obtained, such as through transmission over a network from amedia source. The media content 250 can be various types from varioussources. For example, the media content 250 can be movies that arebroadcast or accessed on demand. In one embodiment, the media content250 can be still images. In one embodiment, the media content 250 can beimages of an object that can be manipulated, such as presenting imagesof a car that can be rotated. The media content 250 can be received as2D content and converted to 3D content and/or can be received as 3Dcontent. The media content 250 can be received by the computing device130 (e.g., a centralized distribution process) and/or received by one ormore of the media processors 106 (e.g., a distributed or master-slaveprocess). It should be understood that the present disclosure caninclude the media processor 106 being various types of devices,including personal computers, set top boxes, smart phones and so forth.

At step 804, video content 225 can be received from a plurality ofdifferent media receivers 106 at different locations. The video content225 can be received as part of a communication session establishedbetween media processors 106 of each of the different users. Each of thevideo content 225 can be received as 2D content and converted to 3Dcontent and/or can be received as 3D content. Each of the video content225 can be received by the computing device 130 (e.g., a centralizeddistribution process) and/or received by one or more of the mediaprocessors 106 (e.g., a distributed or master-slave process). The videocontent 225 can be captured by one or more cameras 175 at each location,where the cameras are 2D and/or 3D cameras. Other components can also beused to facilitate capturing the video content 225, including lightingcomponents and/or audio components, which can be controlled locallyand/or remotely (e.g., by the computing device 130 or a master mediaprocessor 106).

At step 806, it can be determined if 3D content has been requested or isotherwise desired. For instance, a user profile associated with eachuser at each location can be accessed by the computing device 130 and/orone or more of the media processors 106 to determine if 3D content isdesired for the media content 250 and/or video content 225. If 3Dcontent is desired then at step 808 the content can be processedaccordingly. For example, if the content received is in 3D format then adetermination can be made if the format is compatible with the mediaprocessors 106 and adjusted accordingly. For instance, content can beadjusted to be compatible with a first media processor 106 and a copy ofthe content can be further adjusted to be compatible with a second mediaprocessor. If the content is in 2D format then the content can beconverted to 3D format, such as through use of a depth map or usingother techniques.

At step 809, images can be captured of the object 730 using the camerasystem 725. The images can capture a plurality of different viewingangles or perspectives of the object 730 so that the object content 750can be generated such that the object is presented as 3D content. In oneembodiment, the 3D content can be holographic content. The objectcontent 750 can be generated based on captured 2D and/or 3D images,including 3D images captured by a plurality of stereoscopic cameras 175of camera system 725. The camera system 725 can be controlled locallyand/or controlled remotely, such as by the computing device 130. Thecontrol over the camera system 725 can include re-positioning of thecameras 175, as well as other adjustable features, including resolution,speed, and so forth.

In one embodiment, one or more locations (e.g., locations 102, 502 and503) can include the camera system 725 so that a user at the particularlocation can virtually share any objects (e.g., object 730) with otherusers through use of the camera system 725.

In one embodiment, the user 525 associated with the camera system 725can be a merchant or other entity providing goods or services. Forexample, the object 730 can be a product being sold by the merchant. Thelocation 503 can be a sales facility associated with the user or can bea location that is being utilized by the user 525 to sell his or herproduct (e.g., object 730). In one example, the merchant can be chargedfor selling the product by a service provider operating portions of thesystem 700, such as the computing device 130 and/or the camera system725. In one example, revenue that is generated as a result ofpresentation of the object content 750 can be shared between themerchant and the service provider. Other fee sharing arrangements withmerchants for utilization of the camera system 725 can also be used bythe present disclosure.

At step 810, the media content 250 and/or the object content 750, alongwith the video content 225 can be presented at each display device ofeach location in a telepresence configuration, such as configuration 210of FIG. 2 and/or configuration 710 of FIG. 7. The telepresenceconfiguration can simulate each of the users being co-located at eachlocation. In one embodiment, the telepresence configurations can beadjustable, such as by the user selecting the configuration. Theadjustments to the telepresence configuration can include positioning ofthe video content, size, resolution, and so forth.

In one embodiment at step 812, the computing device 130 and/or the mediaprocessor 106 can monitor to detect speech of a user at one of thelocations. If speech is detected from a target user, then at step 814the video content can be adjusted (e.g., by the computing device 130and/or the media processor 106) to further simulate the target userspeaking to the other users. This simulation can include depicting thetarget user or a portion thereof (e.g., the user's head) turning to facethe viewer of the display device to speak with them. In one embodiment,the telepresence configuration can provide images of the rear of theother user's head's as if they were watching the media content and thenpresent the face of the target user when the target user is speaking. Inone embodiment, images of a front of a user's head can be used togenerate video content depicting the back of the user's head, such asthrough determining shape, circumference, hair color and so forth.

In one embodiment at step 816, user interaction with the object content750 can be detected or otherwise determined, such as by one of the mediaprocessors 106 and/or the computing device 130. The user interaction canbe based on user inputs at a user interface at one of the locations 102,502, 503. At step 818, the object content 750 can be adjusted inresponse to the user interaction. The adjustment can be performed in anumber of different ways, including based on utilizing different imageswith different viewing angles, adjusting the cameras 175 of camerasystem 725 to provide for different perspective, and/or extrapolatingviews based on the captured images.

In one embodiment, the user interaction can be based on movement of theuser, such as movement of the user's hand towards the presented object730. As an example, the user 520 at location 102 can be viewing a 3D orholographic representation of the object 730 and can move his or herhand so as to gesture rotating the object 730. In the telepresenceconfigurations of locations 502 and 503, the gestures of the user 520can be viewed as the hand of the user rotating the object 730 due to thepositioning of the object content 750 and the video content 225 in thetelepresence configuration 710. The interaction is not limited to movingthe object 730, and can include other interaction, such as removing aportion of the object to present a different view.

In one embodiment at step 820, system 700 can provide telepresencemessaging between users. For instance, user 520 at location 102 can senda message to user 525 at location 502. The message can be input by theuser 520 via text, speech, and/or selection of pre-determined messages.The message can be presented in the telepresence configuration 710 atpresentation device 202 of location 503 via 3D or holographic text. Themessage can be presented in combination with, or in place of, the mediacontent 250 and/or the object content 750. In one embodiment, the senderof the message can select the recipient(s) of the message so that onlyselect users can see the message even though other users may beparticipating in the communication session. In one embodiment, themessage can be sent in conjunction with a social network and/ormessaging service, including Facebook, Twitter and so forth.

FIG. 9 depicts an illustrative embodiment of another communicationsystem 900 that can present a telepresence configuration 910 atpresentation devices 108 of different users at different locations 902,903, 904 via a telepresence communication session. System 900 can beoverlaid or operably coupled with the devices, systems and methods ofFIGS. 1-8 to receive media content and/or video content in thetelepresence configuration 910. The media content and/or video content(e.g., media content 250 and video content 225 of FIG. 2) can bepresented as 3D or holographic content. System 900 can includecomponents similar to that of system 600, such as the media processor106, the computing device 130 and the cameras 175. The presentationdevices 108 can be various types of display devices includingtelevisions, holographic display devices, volumetric display devices,and so forth. The cameras 175 can be various types of devices, including2D and 3D cameras and can be any number and configuration of cameras,including system 725 of FIG. 7. While three locations are illustrated insystem 900, the present disclosure can utilize two or more locations.

The media processors 106 of the locations 902, 903, 904 can communicatewith each other and/or with the computing device 130 over a network 950that includes network elements 955. The network elements 955 can bevarious devices utilized for providing communication, including routers,switches, servers, DSLAMs, and so forth. The number and configuration ofthe network elements 955 can vary. A multimedia source 960 can beutilized for sourcing the media content to the media processors 106 ofthe locations 902, 903, 904, such as via the computing device 130,although other sources can also be used by the present disclosure,including local sources, such as a DVR at one of the locations 902, 903,904.

System 900 can provide for latency testing to be performed with respectto the media processors 106 of the locations 902, 903, 904, as well aswith respect to the network elements 955 that could be used forproviding the telepresence sessions between these locations. The type oflatency testing performed can vary. For example, loopback testing can beperformed by the computing device 130 to each of the media processors106 of the locations 902, 903, 904. The loopback testing can also beoriginated from devices other than the computing device 130, such asfrom the media processors 106 of one or more of the locations 902, 903,904 and/or from one or more network elements 955, such as along apotential route of the telepresence session. In one embodiment, multipleloopback tests can be originated from multiple devices along potentialroutes of the telepresence session. The results of this group ofloopback tests can be utilized to isolate particular network elements955 that are experiencing latency.

Other latency testing techniques can also be utilized by the presentdisclosure for isolating network elements 955 experiencing latency,including periodically or otherwise gathering latency parametersassociated with all or a portion of the network elements 955 of thenetwork 950. The latency parameters can be analyzed for determiningparticular network elements 955 experiencing latency. It should beunderstood that the latency can be caused by various factors, includingworkload, faults, on-going maintenance, and so forth. In one or moreembodiments, the methodology and/or the components used to determinewhich network elements 955 of the network 950 are experiencing latencycan be selected based on a known or predicted cause of the latency.

In one embodiment, when one or more network elements 955 of the network950 are determined to be experiencing latency, then a latency area 980can be determined or otherwise defined for the network 950. The latencyarea 950 can be determined based on the isolated network elements 955experiencing the latency, as well as a known topology of the network950. For example, the latency area 980 of FIG. 9 depicts three networkelements 955A, 955B, 955C. In this example, network elements 955A and955C have been determined to be experiencing latency while no suchdetermination has been made with respect to network element 955B.However, the network element 955B has been included in the latency area980 because, based on the network topology, it has been determined thatrouting 985 between network element 955A and network element 955C wouldbe done through network element 955B.

In one or more embodiments, the routes for the telepresence session canbe configured or otherwise determined based on the latency area 980. Forexample, routes can be configured to avoid all or a portion of thenetwork elements 955 in the latency area 980. The configuration of theroutes can be performed by a number of different devices (e.g., thecomputing device 130) and can be performed in a centralized ordistributed fashion (e.g., using a group of computing devices 130positioned in different parts of the network 950).

In one or more embodiments, dedicated routes can be utilized for thetelepresence sessions. For example, heavy users of telepresence sessionsand/or users that have obtained a service upgrade may be provided withdedicated routes using select network elements 955 that are intended toreduce latency in the transmission and/or receipt of the telepresencesession signals. In one or more embodiments, the select network elements955 of the dedicated routes can be dedicated devices that are used onlyfor telepresence sessions and/or for limited functions that includetelepresence sessions. In one or more embodiments, the select networkelements 955 of the dedicated routes can be devices (dedicated devicesand/or non-dedicated devices) that are known to have lower latency, suchas due to lower workloads, higher processing resources, and so forth.

Continuing with the example set forth in system 900, one dedicated route970 is illustrated between media processor 106 of location 903 and thecomputing device 130. This example illustrates the locations 902 and 904utilizing non-dedicated routes through the network 950. The number andconfiguration of dedicated routes can vary, including providing all oronly a portion of the media processors 106 of the locations 902, 903,904 with dedicated routes to and from the computing device 130. Otherdedicated routes can also be utilized, such as where data is beingexchanged with other devices, such as routes directly between mediaprocessors 106 of the locations 902, 903, 904 without routing to thecomputing device 130.

In one or more embodiments, the latency area 980 can be utilized forreconfiguring the dedicated route 970. For instance, the dedicated route970 might normally include network element 955C. But, since networkelement 955C has been determined to be part of latency area 980, thededicated route 970 can be re-configured to avoid use of network element955C through re-routing to network element 955D and to network element955E.

System 900 also provides for injecting delay into the presentation ofone or more of the telepresence configurations 910, including portionsof the telepresence configuration, such as the media content. As anexample, a determination can be made as to which of the locations 902,903, 904 are experiencing the largest latency in presentation of themedia content and/or the video content in the telepresenceconfiguration. One or more delay time periods can be determined based onthis latency and a delay(s) can be injected into presentation of thetelepresence configuration for the other locations. The delay(s) can beapplied to both the media content and the video content or can beseparately applied, including use of different delay periods for themedia content and the video content. By delaying the presentation of theother devices by the delay time(s) associated with the locationexperiencing the most latency, system 900 can provide a synchronizedpresentation of the telepresence configuration. The delay(s) can beinjected by the computing device 130, such as by delaying delivery ofthe media content to locations 902, 903 when location 904 isexperiencing the largest latency for the media content. The delay(s) canalso be injected by the media processors 106 at select locations,including based on a delay period calculated by the computing device 130for the other location and transmitted to the media processors, whenpresenting the telepresence configuration 910 at the display devices 108of the select locations.

FIG. 10 depicts an illustrative embodiment of a method 1000 operating inportions of the devices, systems and/or methods described herein and/orillustrated in FIGS. 1-9. Method 1000 can begin with step 1002 in whicha request for a telepresence session is received. The request can bereceived at various devices, depending on how the telepresence sessionis being managed. For example, in system 900 of FIG. 9, the telepresencesession request can be received by computing device 130 from one or moreof the media processors 106 at locations 902, 903, 904. In oneembodiment, the telepresence session request can be received inconjunction with a social network application.

In step 1004, latency testing can be performed for network elements thatcould potentially deliver signals for the telepresence session (i.e.,the element is part of a possible route for the telepresence session).The type of latency testing can vary and can include loopback testing,such as from the computing device 130 to each of the media processors106 at locations 902, 903, 904. The latency testing can be performedbetween other devices of the network, including between network elementsin order to isolate select network elements that are experiencinglatency issues. The latency testing can be performed at various times.For example, latency testing can be performed in response to receivingthe request for the telepresence session and/or can be performed atother times, such as periodically. Other types of latency testing canalso be included in the present disclosure, including gathering packetlatency telemetry from all or a portion of the network elements.

In step 1004, a latency area can be detected or otherwise determinedbased on the results of the latency testing. The latency area caninclude network elements experiencing latency issues. The latency areacan further include other network elements that have not been determinedto be experiencing latency issues but due to their position in proximityto those network elements, they are included in the latency area. Instep 1008, routes for the telepresence session can be configured basedon the latency areas. For example, routes can be configured to avoid allor a portion of the network elements in the latency area.

In step 1010, it can be determined whether any of the users, such as atlocations 902, 903, 904, have dedicated routes. For example, a user mayhave a service plan that includes dedicated routes for telepresencesessions. In one or more embodiments, service plan upgrades can beoffered in response to a request for a telepresence session. In one ormore embodiments, usage of telepresence sessions by a user can bemonitored to generate a history for the user. The history can becompared to a usage threshold to determine if a dedicated route shouldbe provided to the user for the telepresence session.

If the user is permitted to utilize a dedicated route then in step 1012the route can be re-configured based on the dedicated route. It shouldbe further understood that the sequence of the steps of method 1000 canbe changed. For example, dedicated routes can first be determined andthen the dedicated routes can be altered when the dedicated route passesthrough, or otherwise relies upon, a network element of the latencyarea. If on the other hand, there are no dedicated routes then method1000 proceeds to step 1014 to determine if latency issues still exist.

If there are no latency issues remaining or if the latency issues arewithin acceptable tolerances then in step 1018 the telepresence sessioncan be provided. If on the other hand, there are latency issues outsideof acceptable tolerances then in step 1016 a delay can be injected intothe presentation of the telepresence configuration at a portion of thelocations. For example, a determination can be made as to which locationis experiencing the greatest latency and a delay period can becalculated based on that latency. A local delay can be injected, such asthe other media processors 106 at the other locations delayingpresentation of the media content and/or the video content tosynchronize the telepresence configurations at each location. Thepresent disclosure can also use a remote delay, such as the computingdevice 130 delaying providing the media content and/or the video contentto a portion of the locations based on a calculated delay period.

In one embodiment, the delay period can be based on a difference inlatency between the different locations. For example, a first locationmay present an image at a 20 ms relative mark, while a second locationpresents the same image at a 40 ms relative mark and a third locationpresents the same image at a 60 ms relative mark. A first delay periodcan be calculated for the first location to be 40 ms based on the delaydifference between the first and third location. A second delay periodcan be calculated for the second location to be 20 ms based on the delaydifference between the second and third location. No delay would beprovided to the third location in this example. The method 1000 canproceed to step 1018 to provide the telepresence session.

Upon reviewing the aforementioned embodiments, it would be evident to anartisan with ordinary skill in the art that said embodiments can bemodified, reduced, or enhanced without departing from the scope andspirit of the claims described below. The embodiments described abovecan be adapted to operate with any device capable of performing in wholeor in part the steps described for methods 800 and 1000.

In one embodiment, the latency testing, the determination of the latencyarea, the configuration of the routes and/or the injection of the delaycan be performed at various times, including during the telepresencesession. For example, testing can be periodically performed during thetelepresence session to determine if there has been a change to thelatency area and/or a change to a delay period to be injected into thepresentation of the telepresence configuration by one or more of themedia processors. If a change is detected then corresponding correctionscan be made, such as re-configuring routes and/or changing the delaytime period.

In one embodiment, configuring routes based on latency areas andinjecting a delay into the presentation of the telepresenceconfiguration at a portion of the locations can be selectively appliedbased on thresholds. For example, injecting delay into the presentationof the telepresence configuration for a portion of the locations can beutilized without re-configuring routes based on latency areas when afirst latency threshold has not been satisfied. However, when the firstlatency threshold is satisfied (e.g., latency time periods exceeding apre-determined amount) then both techniques may be applied tosynchronize the telepresence configurations at each of the locations.

In one embodiment, the delay can be implemented to video alone, audioalone and/or to both video and audio. In one embodiment, the delayperiod can be calculated for the video portion of the content and theaudio portion can be synchronized with the video portion.

In one embodiment, a combination of media content 250 and object content750 can be presented in the telepresence configurations. For example, amerchant can present the object content 750 for a product (images ofwhich are captured by the camera system 725) being sold while presentingthe media content 250 that is an infomercial describing the product.

In one embodiment, the device(s) that perform the functions describedherein can be selected based on capability. For example, if all mediaprocessors 106 have the ability to generate 3D video content then adistributed process can be utilized that does not utilize the computingdevice 130. If only a portion of the media processors 106 have theability to generate 3D content then a master-slave arrangement can beestablished between the media processors 106 without the need to utilizethe computing device 130. If none of the media processors 106 have theability to generate 3D content then the computing device 130 can beutilized for generating 3D content. Similarly, 2D images captured by a2D camera can be transmitted to a device capable of generating 3D videocontent, such as the computing device 130 and/or another media processor106. In one embodiment, the selection of the device(s) can be based onother factors, including processing resources, workload, type of contentand so forth. For example, if only one media processor 106 has thecapability to generate 3D content then the computing device 130 may beutilized along or in conjunction with the select media processor forgenerating the 3D content.

In one embodiment, the selection of the media content can be performedin conjunction with a negotiation process amongst at least a portion ofthe users that are intended to receive the telepresence configuration.For example, the users can vote on the media content to be presented. Inanother embodiment, priority can be provided to particular users for thenegotiating process, such as priority based on device capability. Asanother example, past voting history can be used as a factor in theselection of the media content, such as weighting votes more heavilywhen the user has been unsuccessful in voting to select media content inthe past.

In one embodiment, the selection of the media content can be based onfactors associated with one of the users. For example, the other usersmay desire to wish happy birthday to a target user. A telepresencesession can be established with the target users and the other users inwhich the media content is a particular singer singing a birthday songto the target user. The selection of the singer can be done based on apreference of the target user, including based on monitored consumptionhistory by the target user of songs.

In one embodiment, the providing of the telepresence configuration canbe done in conjunction with a social network application. For example,each of the users can be members of the social network and theestablishing of the communication session between the different userscan be initiated based on selections made from the social networkapplication.

In one embodiment, the presentation of the telepresence configuration bya media processor 106 can be done at multiple display devices. Forexample, in a system that has three display devices positioned adjacentto each other, the media processor 106 can provide a middle displaydevice with the media content for presentation while providing the enddisplay devices with each of the video content from the other users tosimulate the other users being co-located at the location of the mediaprocessor 106.

Other suitable modifications can be applied to the present disclosurewithout departing from the scope of the claims below. Accordingly, thereader is directed to the claims section for a fuller understanding ofthe breadth and scope of the present disclosure.

FIG. 11 depicts an exemplary diagrammatic representation of a machine inthe form of a computer system 1100 within which a set of instructions,when executed, may cause the machine to perform any one or more of themethodologies discussed above. In some embodiments, the machine operatesas a standalone device. In some embodiments, the machine may beconnected (e.g., using a network) to other machines. In a networkeddeployment, the machine may operate in the capacity of a server or aclient user machine in server-client user network environment, or as apeer machine in a peer-to-peer (or distributed) network environment.

The machine may comprise a server computer, a client user computer, apersonal computer (PC), a tablet PC, a laptop computer, a desktopcomputer, a control system, a network router, switch or bridge, or anymachine capable of executing a set of instructions (sequential orotherwise) that specify actions to be taken by that machine. It will beunderstood that a device of the present disclosure includes broadly anyelectronic device that provides voice, video or data communication.Further, while a single machine is illustrated, the term “machine” shallalso be taken to include any collection of machines that individually orjointly execute a set (or multiple sets) of instructions to perform anyone or more of the methodologies discussed herein.

The computer system 1100 may include a processor or controller 1102(e.g., a central processing unit (CPU), a graphics processing unit (GPU,or both), a main memory 1104 and a static memory 1106, which communicatewith each other via a bus 1108. The computer system 1100 may furtherinclude a video display unit 1110 (e.g., a liquid crystal display (LCD),a flat panel, a solid state display). The computer system 1100 mayinclude an input device 1112 (e.g., a keyboard), a cursor control device1114 (e.g., a mouse), a disk drive unit 1116, a signal generation device1118 (e.g., a speaker or remote control) and a network interface device1120. The devices of computer system 1100 can be found in the previouslyshown figures, such as computing device 130, camera system 725, camera175, media processor 106, TV 202 and so forth.

The disk drive unit 1116 may include a machine-readable medium 1122 onwhich is stored one or more sets of instructions (e.g., software 1124)embodying any one or more of the methodologies or functions describedherein, including those methods illustrated above. The instructions 1124may also reside, completely or at least partially, within the mainmemory 1104, the static memory 1106, and/or within the processor orcontroller 1102 during execution thereof by the computer system 1100.The main memory 1104 and the processor 1102 also may constitutemachine-readable media. The instructions 1124 can include one or more ofthe steps described above, including calibration steps, such asdetermining or interpolating viewer distance, determining convergencefrom viewer distance, and so forth.

Dedicated hardware implementations including, but not limited to,application specific integrated circuits, programmable logic arrays andother hardware devices can likewise be constructed to implement themethods described herein. Applications that may include the apparatusand systems of various embodiments broadly include a variety ofelectronic and computer systems. Some embodiments implement functions intwo or more specific interconnected hardware modules or devices withrelated control and data signals communicated between and through themodules, or as portions of an application-specific integrated circuit.Thus, the example system is applicable to software, firmware, andhardware implementations.

In accordance with various embodiments of the present disclosure, themethods described herein are intended for operation as software programsrunning on a computer processor. Furthermore, software implementationscan include, but not limited to, distributed processing orcomponent/object distributed processing, parallel processing, or virtualmachine processing can also be constructed to implement the methodsdescribed herein.

The present disclosure can include a machine readable medium containinginstructions 1124, or that which receives and executes instructions 1124from a propagated signal so that a device connected to a networkenvironment 1126 can send or receive voice, video or data, and tocommunicate over the network 1126 using the instructions 1124. Theinstructions 1124 may further be transmitted or received over a network1126 via the network interface device 1120.

While the machine-readable medium 1122 is shown in an example embodimentto be a single medium, the term “machine-readable medium” should betaken to include a single medium or multiple media (e.g., a centralizedor distributed database, and/or associated caches and servers) thatstore the one or more sets of instructions. The term “machine-readablemedium” shall also be taken to include any medium that is capable ofstoring or encoding a set of instructions for execution by the machineand that cause the machine to perform any one or more of themethodologies of the present disclosure.

The term “machine-readable medium” shall accordingly be taken toinclude, but not be limited to: solid-state memories such as a memorycard or other package that houses one or more read-only (non-volatile)memories, random access memories, or other re-writable (volatile)memories; magneto-optical or optical medium such as a disk or tape.Accordingly, the disclosure is considered to include any one or more ofa machine-readable medium, as listed herein and including art-recognizedequivalents and successor media, in which the software implementationsherein are stored.

Although the present specification describes components and functionsimplemented in the embodiments with reference to particular standardsand protocols, the disclosure is not limited to such standards andprotocols. Each of the standards for Internet and other packet switchednetwork transmission (e.g., TCP/IP, UDP/IP, HTML, HTTP), as well as theexamples for calibration, distance determination, communicationprotocols, and so forth, represent examples of the state of the art.Such standards are periodically superseded by faster or more efficientequivalents having essentially the same functions. Accordingly,replacement standards and protocols having the same functions areconsidered equivalents.

The illustrations of embodiments described herein are intended toprovide a general understanding of the structure of various embodiments,and they are not intended to serve as a complete description of all theelements and features of apparatus and systems that might make use ofthe structures described herein. Many other embodiments will be apparentto those of skill in the art upon reviewing the above description. Otherembodiments may be utilized and derived therefrom, such that structuraland logical substitutions and changes may be made without departing fromthe scope of this disclosure. Figures are also merely representationaland may not be drawn to scale. Certain proportions thereof may beexaggerated, while others may be minimized. Accordingly, thespecification and drawings are to be regarded in an illustrative ratherthan a restrictive sense.

Such embodiments of the inventive subject matter may be referred toherein, individually and/or collectively, by the term “invention” merelyfor convenience and without intending to voluntarily limit the scope ofthis application to any single invention or inventive concept if morethan one is in fact disclosed. Thus, although specific embodiments havebeen illustrated and described herein, it should be appreciated that anyarrangement calculated to achieve the same purpose may be substitutedfor the specific embodiments shown. This disclosure is intended to coverany and all adaptations or variations of various embodiments.Combinations of the above embodiments, and other embodiments notspecifically described herein, can be used by the present disclosure.

The Abstract of the Disclosure is provided with the understanding thatit will not be used to interpret or limit the scope or meaning of theclaims. In addition, in the foregoing Detailed Description, it can beseen that various features are grouped together in a single embodimentfor the purpose of streamlining the disclosure. This method ofdisclosure is not to be interpreted as reflecting an intention that theclaimed embodiments require more features than are expressly recited ineach claim. Rather, as the following claims reflect, inventive subjectmatter lies in less than all features of a single disclosed embodiment.Thus the following claims are hereby incorporated into the DetailedDescription, with each claim standing on its own as a separately claimedsubject matter.

What is claimed is:
 1. A method comprising: receiving, by a processingsystem including a processor, a first image of a first user, a secondimage of a second user, and a plurality of images of an object, whereinthe plurality of images of the object comprise a third image of theobject that had been captured from a first camera angle, and a fourthimage of the object that had been captured from a second camera angle,wherein the first camera angle is different from the second cameraangle, and wherein the first user and the second user participate in aconference call; generating, by the processing system, a fifth image ofthe object, wherein the fifth image includes a portion of the object notincluded in the third and fourth images and wherein the fifth image isgenerated based upon one of: the third image, the fourth image, and acombination of the third image and the fourth image; generating, by theprocessing system, first joint media content of the conference call thatcomprises the second image of the second user and a given one of thethird and fourth images of the object; generating, by the processingsystem, second joint media content of the conference call that comprisesthe first image of the first user and the given one of the third andfourth images of the object; providing, by the processing system, thefirst joint media content to a first display associated with the firstuser to cause presentation of the first joint media content on the firstdisplay; providing, by the processing system, the second joint mediacontent to a second display associated with the second user to causepresentation of the second joint media content on the second display;detecting, by the processing system, gesture input from the first userdirected towards the first joint media content presented on the firstdisplay; and adjusting by the processing system, responsive to detectingof the gesture input, the second joint media content resulting inadjusted second joint media content, wherein the adjusted second jointmedia content includes, instead of the given one of the third and fourthimages of the object in the second joint media content, the fifth imageof the object.
 2. The method of claim 1, wherein the detecting of thegesture input further comprises detecting, by the processing system, agesture simulating rotating the object of the first joint media content,and wherein inclusion of the fifth image in the adjusted second jointmedia content simulates rotation of the object.
 3. The method of claim1, wherein the adjusting of the second joint media content furthercomprises adjusting, by the processing system, the first image of thefirst user.
 4. The method of claim 3, wherein the adjusting of the firstimage of the first user of the second joint media content furthercomprises adjusting, by the processing system, the first image of thefirst user turned towards the object.
 5. The method of claim 1, whereinthe adjusting of the second joint media content further comprisessimulating, by the processing system, that the first user is makingcontact with the object.
 6. The method of claim 1, further comprisingproviding, by the processing system, the adjusted second joint mediacontent to the second display to cause presentation of the adjustedsecond joint media content on the second display.
 7. The method of claim1, further comprising: adjusting by the processing system, responsive tothe detecting of the gesture input, the first joint media contentresulting in adjusted first joint media content; and providing, by theprocessing system, the adjusted first joint media content to the firstdisplay to cause presentation of the adjusted first joint media contenton the first display.
 8. A device, comprising: a processing systemincluding a processor; and a memory storing executable instructionsthat, when executed by the processing system, perform operationscomprising: obtaining a first image of a first user, a second image of asecond user, and a plurality of images of an object, wherein theplurality of images of the object comprise a third image of the objectthat had been captured from a first camera angle, and a fourth image ofthe object that had been captured from a second camera angle, whereinthe first camera angle is different from the second camera angle, andwherein the first user and the second user participate in a conferencecall; generating a fifth image of the object, wherein the fifth image isgenerated using interpolation, wherein the fifth image includes aportion of the object not included in the third and fourth images andwherein the fifth image is generated based upon a combination of thethird image and the fourth image; producing first joint media content ofthe conference call that comprises the second image of the second userand a given one of the third and fourth images of the object; producingsecond joint media content of the conference call that comprises thefirst image of the first user and the given one of the third and fourthimages of the object; providing the first joint media content to a firstdisplay associated with the first user to cause presentation of thefirst joint media content on the first display; providing the secondjoint media content to a second display associated with the second userto cause presentation of the second joint media content on the seconddisplay; detecting a user interaction from the first user directedtowards the first joint media content presented on the first display;and adjusting, responsive to the detecting of the user interaction, thesecond joint media content resulting in adjusted second joint mediacontent, wherein the adjusted second joint media content includes,instead of the given one of the third and fourth images of the object inthe second joint media content, the fifth image of the object.
 9. Thedevice of claim 8, wherein the user interaction is a gesture from thefirst user towards the first joint media content.
 10. The device ofclaim 8, wherein the detecting of the user interaction from the firstuser directed towards the first joint media content further comprisesdetecting a gesture simulating removing a selected portion of theobject.
 11. The device of claim 10, wherein the adjusting the secondjoint media content further comprises showing a remaining portion of theobject in response to detecting the gesture simulating removing theselected portion of the object of the first joint media content.
 12. Thedevice of claim 8, wherein the detecting of the user interaction fromthe first user directed towards the first joint media content furthercomprises detecting a gesture simulating rotating the object, andwherein inclusion of the fifth image in the adjusted second joint mediacontent simulates rotation of the object.
 13. The device of claim 8,wherein the operations further comprise adjusting of the first jointmedia content resulting in adjusted first joint media content.
 14. Thedevice of claim 13, wherein the adjusting of the first joint mediacontent further comprise adjusting of the object.
 15. The device ofclaim 13, where the operations further comprise providing the adjustedfirst joint media content to the first display to cause presentation ofthe adjusted first joint media content on the first display.
 16. Anon-transitory, machine-readable storage medium, comprising executableinstructions that, when executed by a processing system including aprocessor, perform operations comprising: responsive to receiving afirst image of a first user, a second image of a second user, and athird image of an object, wherein the first user and the second userparticipate in a conference call, generating a first joint media contentof the conference call that comprises the second image of the seconduser and the third image of the object, and generating second jointmedia content of the conference call that comprises the first image ofthe first user and the third image of the object, wherein the thirdimage of the object had been captured from a first camera angle;generating a fourth image of the object, wherein the fourth image isgenerated based upon the third image using extrapolation, wherein thefourth image includes a portion of the object not included in the thirdimage and wherein the fourth image is associated with a second cameraangle that is different from the first camera angle; providing the firstjoint media content to a first display associated with the first user tocause presentation of the first joint media content on the firstdisplay; providing the second joint media content to a second displayassociated with the second user to cause presentation of the secondjoint media content on the second display; detecting a user interactionfrom the first user directed towards first joint media content presentedon the first display; and adjusting, responsive to the detecting of theuser interaction, the second joint media content resulting in adjustedsecond joint media content, wherein the adjusted second joint mediacontent includes, instead of the third image of the object in the secondjoint media content, the fourth image of the object.
 17. Thenon-transitory, machine-readable storage medium of claim 16, wherein theoperations further comprise adjusting of the first joint media contentresulting in adjusted first joint media content responsive to detectingthe user interaction, wherein the adjusting of the first joint mediacontent further comprises adjusting the object.
 18. The non-transitory,machine-readable storage medium of claim 17, where the operationsfurther comprise providing the adjusted first joint media content tocause presentation of the adjusted first joint media content on thefirst display.
 19. The method of claim 1, wherein the fifth image isgenerated using extrapolation.
 20. The method of claim 1, wherein thefifth image is generated using interpolation.